Intron length distributions and gene prediction

نویسندگان

  • Scott William Roy
  • David Penny
چکیده

Accurate gene prediction in eukaryotes is a difficult and subtle problem. Here we point out a useful feature of expected distributions of spliceosomal intron lengths. Since introns are removed from transcripts prior to translation, intron lengths are not expected to respect coding frame, thus the number of genomic introns that are a multiple of three bases ('3n introns') should be similar to the number that are a multiple of three plus one bases (or plus two bases). Skewed predicted intron length distributions thus suggest systematic errors in intron prediction. For instance, a genome-wide excess of 3n introns suggests that many internal exonic sequences have been incorrectly called introns, whereas a deficit of 3n introns suggests that many 3n introns that lack stop codons have been mistaken for exonic sequence. A survey of genomic annotations for 29 diverse eukaryotic species showed that skew in intron length distributions is a common problem. We discuss several examples of skews in genome-wide intron length distributions that indicate systematic problems with gene prediction. We suggest that evaluation of length distributions of predicted introns is a fast and simple method for detecting a variety of possible systematic biases in gene prediction or even problems with genome assemblies, and discuss ways in which these insights could be incorporated into genome annotation protocols.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

P-127: The Effect of Beta Globin Intron on Human FSH Hormone Expression in CHO Cells

Background Follicle stimulating hormone (FSH)- a hetrodimeric glycoprotein- is secreted by pituitary gland. This hormone stimulates growth and maturation of the follicles in females and sperms in male. Up to now, glycoprotein hormones such as FSH have produced in different cell lines. Among of the mammalian expression systems, the Chinese hamster ovary cells (CHO) have taken into consideration ...

متن کامل

Growth Hormone Gene Polymorphism in Two Iranian Native Fowls (Short Communication)

Biochemical polymorphism study is a method of determination of genetic variation. This variability could be a basis for selection and subsequent genetic improvement in farm animals. The polymorphism in the intron 1 of chicken growth hormone (cGH) gene was investigated in the Iranian native fowls by using polymerase chain reaction (PCR)-restriction fragment length polymorphism (RFLP) method. The...

متن کامل

Intron length increases oscillatory periods of gene expression in animal cells.

Introns may affect gene expression by increasing the time required to transcribe the gene. One way for extended transcription times to affect the behavior of a gene expression program is through a negative feedback loop. Here, we show that a logically engineered negative feedback loop in animal cells produces expression pulses, which have a broad time distribution that increases with intron len...

متن کامل

Comparative Evaluation of Intron Prediction Methods and Detection of Plant Genome Annotation Using Intron Length Distributions

Intron prediction is an important problem of the constantly updated genome annotation. Using two model plant (rice and Arabidopsis) genomes, we compared two well-known intron prediction tools: the Blast-Like Alignment Tool (BLAT) and Sim4cc. The results showed that each of the tools had its own advantages and disadvantages. BLAT predicted more than 99% introns of whole genomic introns with a sm...

متن کامل

Single Nucleotide Polymorphisms (SNPs) of GDF9 Gene in Bahmaei and Lak Ghashghaei Sheep Breeds and Its Association with Litter Size

Growth differentiation factor 9 (GDF9) belong to the superfamily of transforming growth factor β that is highly expressed in growing ovarian follicles of oocyte, and it has been strongly related to fecundity traits in sheep. Therefore, the GDF9 gene could serve as a genetic marker for improvement of reproductive performance in sheep. Therefore, the aim of this study was to invest...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 35  شماره 

صفحات  -

تاریخ انتشار 2007